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Power Sensors 


Initial support in 4.6 

Improved to be useful in 4.9 

Exposed through hwmon 

Based on Martins initial work 

Supported on mid+-end GPUs starting with Fermi 
Can hint at bad GPU utilization within mesa 

e Should be moved into the PMU 


Static Power Consumption - Clock Gating 
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Up to 25% power savings 
Benefits some high loads 
WIP for Fermi, Kepler and Maxwell 


doesn’t work without secboot 


Static Power Consumption - Power Gating 


e Might be able to close the gap 
e Use counting of engines already in place 
o Needs a lot of REing 


GPU Boost - Introduction 


Advances DVFS 

1.0: increase clocks until reaching power target 

2.0: increase clocks until reaching temperature target 
Added officially with Kepler 


Already some bits on Fermi 


Mostly implemented in the kernel driver 


GPU Boost - How it (really) works 


o Limiting the voltage on high temperatures 
e Information defined in a bunch of VBIOS tables 


GPU Boost - Voltage 


ə VID table 

ə How to voltage (GPIOs, PWM) 

ə What voltages can be set (ranges, step sizes) 
e VMAP table 

e Voltage calculations depending on 


o T: Temperature 
e S: Individual chip calibration factor (Speedo) 


ə Defines “max” voltage entries 
a*S a Cox S? 


e volto = 5 + 


10 10° 
Cox 55 ci: Sx50 Cox Tx56 Cax Sx T 5° ca S2*5° cs* T?x56 
o volti = DIS 1518 Fe 310 PE 318 pa 330 p3 DIE 
o volts = min 
ə volt; = min-max 


o CSTEP table 


e Voltage requirements for a C-State 


GPU Boost - Clocking 


ə CSTEP table 
o List of GPC clocks (C-States) 
ə Defines max C-State of P-States 
e PMode table 
e “Default” clocks for each engine 
ə BOOST table 
e Clock adjustments for Engines (eg. 90% of GPC clock) 
ə Min/Max clocks for each P-State 
e VPSTATE table 


ə Actually (partly) documented! 
ə Specifies Base clocks (confirmed) 
e ... and Boost clocks 


GPU Boost - Example: GTX 780 Ti - speedo 0x610 
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GPU Boost - Example: GTX 780 Ti - speedo 0x691 
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GPU Boost - nv. cmp. volt 


Reverse engineering tool 

Reads out hardware state 

Doesn't interact with the NVIDIA driver 

Verifies Nouveau voltage algorithm against NVIDIA 


Demo later 


GPU Boost - Solved problems 


Didn't respect Voltage limits 
Tried to perfectly match voltage 


Missing voltage calculation 


No updates on temperature changes (WIP) 


State of reclocking - What is in a good state? 


Hardware thermal protection up to GM10x 

Fan management up to GM10x 

Engine reclocking for most cards up to Maxwell 

Memory reclocking for Tesla (G92+), Kepler and Maxwell 
PCle link configuring 

GPU Boost (partly in 4.9) 


State of reclocking - Work in Progess 


PMU engine load counters for GT215+ 
Memory reclocking on Fermi (Roy Spliet) 


Software thermal protection 


Power budgets 


State of reclocking - What is missing? 


Clock read out while hardware thermal protected 
Maxwell2 fan control (signed PMU firmware required) 
Advanced GK110+ engine reclocking 

Hi-PLL Fermi+ engine reclocking 

Linebuffer/isohub? (flicker free memory reclocking) 
New fan control VBIOS table for Kepler+ 

Memory reclocking on G80-G86 


Pascal is completly different 


State of reclocking - DVFS 


e Policies 

e What is usable in general? 

e How to deal with spiky loads? 
e PMU notifies host about high/low loads 
e Various benchmarks needed 


ə Performance 
o Interactive (Desktop environment) 
e Web browsers 


o Prototype! 
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Demos 


e nv_cmp_ volt 
e DVFS 


